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MANIPULATION OF ORGANIC ACID BIOSYNTHESIS AND SECRETION 

The present invention relates to nucleic acid fragments encoding amino 
. acid sequences for organic acid biosynthetic enzyme polypeptides in plants, and 
the use-thereof for the modification of organic acid biosynthesis and secretion in 
5 plants. In particularly preferred embodiments, the invention relates to the 
combinatorial expression of malate dehydrogenase (MDH) and/or 
phosphoenolpyruvate carboxylase (PEPC) and/or citrate synthase (OS) in plants 
to modify organic acid biosynthesis and secretion. 

Documents cited in this specification are for reference purposes only and 
10 their inclusion is not acknowledgment that they form part of the common general 
knowledge In it^e relevant art. 

Organic acids, such as citrate and malate, are key metabolites in plants. 
They are involved in numerous processes, including 04 and Crassulacean acid 
metabolism (CAM) photosynthesis, stomatal and pulvinual movement, nutrient 

15 uptake, respiration, nitrogen assimilation, fatty acid oxidation, and providing 
energy to bacteroids in root nodules. For example, malate plays a key role in root 
nodule metabolism and nitrogen fixation, serving as the primary carbon source for 
bacteroid maintenance and nitrogenase activity, and is also tightly linked to nodule 
nitrogen assimilation. Furthermore, the complexing role of organic acids produced 

20 and excreted from plant roots has also been associated with tolerance to the 
aluminium cation Al^*^ which is toxic to many plants at mlcromolar concentrations. 
Aluminium toxicity has been recognized as a major limiting factor of plant 
productivity on acidic soils, which account for approximately 40% of the earth's 
arable land. 

25 The tricarboxylic acid cycle (TCA), also known as Krebs cycle (after its 

discoverer Hans Krebs) or citric acid cycle, moves electrons from organic acids to 
the oxidized redox cofactors NAD* and FAD, fomning NADH, FADH2, and carbon 
dioxide (CO2). The reaction sequence of the TCA cycle involves: In a reaction 
catalysed by citrate synthase (CS), acetyl-CoA formed by the pyruvate 

30 dehydrogenase complex combines with oxaloacetate to produce the Ce 
tricarboxylic acid, citrate. In the overall cycle, the citrate is oxidized to produce two 
molecules of CO2 in a series of reactions that leads to the formation of one 
oxaloacetate, three NADH, one FADH2, and one ATP. The resulting oxaloacetate 
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reacts with another molecule of acetyl-CoA to continue the cycle. The oxidative 
decarboxylation of pymvate yields an additional CO2 and NADH. Thus the TCA 
cycle brings about the complete oxidation of pyruvate to three CO2 plus 10 
electrons, which are stored temporarily as 4 NADH and 1 FADH2. 

5 Cytosollc reactions generate products that are transported into the 

mitochondria to feed the TCA cycle. The nature of the end product of the glycolytic 
reactions In the cytosol of plants is detennined by the relative activities of the three 
enzymes that can utilize phosphoenol-pyruvate (PEP) as substrate. Both pyruvate 
kinase and PEP-phosphatase form pyruvate; while PEP-carboxylase (PEPC) 
10 generates oxaloacetate. Pyruvate is transported directly into the mitochondrion. 
Oxaloacetate Is either transported directly Into the mitochondrion or first reduced 
to malate by cytosollc malate dehydrogenase (MDH). 

Before entering the TCA cycle proper, pyruvate Is oxidised and 
decarboxylated by the pyruvate dehydrogenase enzyme complex to form CO2, 
15 acetyl-CoA, and NADH. The pyruvate dehydrogenase enzyme complex, which 
requires the bound cofactors thiamine pyrophosphate, lipoic acid, and FAD as well 
as free coenzyme A (CoASH) and NAD*, links the TCA cycle to glycolysis. 

It Is known that the TCA cycle includes the following enzymes: pyruvate 
dehydrogenase, citrate synthase, citrate hydrolase, Isocitrate dehydrogenase, 
20 oxoglutarate dehydrogenase, succinyl-CoA synthetase, succinate dehydrogenase, 
fumarase, malate dehydrogenase, NAD-malic enzyme and phosphoenolpyruvate 
carboxylase. 

In particular, citrate synthase (CS) catalyses the condensation of acetyl- 
CoA and oxaloacetate to fomn the C6 molecule citrate and fi-ee CoASH, as the 
25 TCA cycle proper begins. 

Malate dehydrogenase (MDH) catalyses the final step of the TCA cycle, 
oxidizing malate to oxaloacetate and producing NADH. This reaction catalysed by 
MDH is reversible, thus allowing also for the reversible reduction of oxaloacetate 
to malate. The enzyme MDH is important in several metabolic pathways, and 
30 higher plants contain multiple forms that differ In co-enzyme specificity and 
subcellular localization. Chloroplasts contain an NADP*-dependent MDH that 
plays a critical role In balancing reducing equivalents between the cytosol and 
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stroma. Plants also contain NAD-dependent MDHs which are found in a) 
mitochondria as part of the TCA cycle; b) cytosol and peroxisomes involved in 
malate-aspartate shuttles; and c) glyoxisomes functioning in p-oxidation. In root 
nodules of nitrogen-fixing legumes, such as white clover {Trifolium repens) and 
5 alfalfa {Medicago sath/a), malate sen/es as the primary cartx>n source to support 
the respiratory needs of the bacterial microsymbiont and the fixation of Na by 
nitrogenase, and a nodule-enhanced MDH is thus critical for nodule function. 

Phosphoenolpyruvate carboxylase (PEPC) catalyses the reaction of 
phosphoenol-pyruvate with HCO3" releasing the phosphate and producing the C4 
10 product, oxaloacetate. Oxaloacetate is commonly reduced to malate by NADH 
through the action of malate dehydrogenase (MDH). PEPC is a homotetrameric 
enzyme widely distributed in most plant tissues. In plants. PEPC fulfils various 
physiological roles such as the photosynthetic CO2 fixation in C4 and 
Crassulacean Acid Metabolism (CAM) plants, and the anaplerotic pathway. 

15 While nucleic acid sequences encoding some organic acid biosynthetic 

enzymes have been isolated for certain species of plants, there remains a need for 
materials useful in modifying organic acid biosynthesis; in modifying organic acid 
secretion; in modifying phosphorus acquisition efficiency in plants; in modifying 
aluminium and acid soil tolerance in plants; In modifying nitrogen fixation and 

20 nodule function, particulariy In forage legumes and grasses, including alfalfa, 
medics, clovers, ryegrasses and fescues, and for methods for their use. 

This invention is directed towards overcoming, or at least alleviating, one or 
more of the difTiculties or deficiencies associated with the prior art. 

In one aspect, the present invention provides substantially purified or 
25 isolated nucleic acids or nucleic acid fragments encoding the organic acid 
biosynthetic polypeptides CS, MDH and PEPC, from a clover {Trifolium), medic 
(Medicago), ryegrass (Lolium) or fescue (Festuca) species, or functionally active 
fragments or variants of these polypeptides. 

The present invention also provides substantially purified or isolated nucleic 
30 acids or nucleic acid fragments encoding amino acid sequences for a class of 
polypeptides which are related to CS, MDH and PEPC (from a clover (Trifolium), 
medic (Medicago), ryegrass (Lolium) or fescue (Festuca) species) of CS, MDH 
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and PEPC, or functionally active fragments or variants of CS, MDH and PEPC, 
Such polypeptides are refen-ed to herein as CS-like, MDH-like and PEPC-like 
respectively and include polypeptides having similar functional activity. 

The present invention also relates to individual or simultaneous 
5 enhancement or otherwise manipulation of CS, MDH and/or PEPC or like gene 
activities in plants to enhance or otherwise alter organic acid biosynthesis; to 
enhance or reduce or otherwise alter organic acid secretion; to enhance or reduce 
or otherwise alter phosphorous acquisition efficiency in plants; to enhance or 
reduce or otherwise alter aluminium and acid soil tolerance in plants; and/or to 
10 enhance or reduce or otherwise alter nitrogen fixation and nodule function In 
legumes. 

The individual or simultaneous enhancement or othenwise manipulation of 
CS, MDH and/or PEPC or like gene activities in plants has significant 
consequences for a range of applications in, for example, plant production, plant 

15 performance, plant nutrition and plant tolerance. For example, it has applications 
in increasing plant tolerance to aluminium-toxic acid soils; in improving plant 
nutrient acquisition efficiency for example in increasing acquisition of phosphorus 
from soils; in increasing nodule function in nitrogen-fixing legumes for example 
leading to enhanced nitrogen fixation; In modifying the accumulation of organic 

20 acids such as citrate In fruits; In modifying the secretion of organic acids for 
example citrate and/or malate from plant roots. 

Manipulation of CS, MDH and/or PEPC or like gene activities in plants, 
including legumes such as clovers {Trifolium species), luceme {Medicago sativa) 
and grass species such as ryegrasses {Lolium species) and fescues {Festuca 
25 species) may be used to facilitate the production of, for example, forage legumes 
and forage grasses and other crops with enhanced tolerance to aluminium toxic 
soils; enhanced nutrient acquisition efficiency; forage legumes with enhanced 
nitrogen fixation; fruits with enhanced organic acid content leading to enhanced 
flavour and health benefits. 

30 The clover (Trifolium), medic {Medicago), ryegrass {Lolium) or fescue 

{Festuca) species may be of any suitable type, including white clover {Trifolium 
repens), red clover {Trifolium pratense), subterranean clover {Trifolium 
subterraneum), alfalfa {Medicago sativa), Italian or annual ryegrass {Lolium 
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muttiflorum), perennial ryegrass {Lolium perenne), tall fescue {Festuca 
arundinacea), meadow fescue {Festuca pmtensis) and red fescue {Festuca rubra). 
Preferably the species is a clover or a ryegrass, more preferably white clover (7. 
repens) or perennial ryegrass {L. perenne). White clover {Trifolium repens L.) and 
5 perennial ryegrass {Lolium perenne L.) are key pasture legumes and grasses, 
respectively, in temperate climates throughout the world. Perennial ryegrass is 
also an important turf grass. 

The nucleic acid or nucleic acid fragment may be of any suitable type and 
includes DNA (such as cDNA or genomic DNA) and RNA (such as mRNA) that is 

10 single- or double-stranded, optionally containing synthetic, non-natural or altered 
nucleotide bases, and combinations thereof. The RNA is readily obtainable, for 
example, by transcription of a DNA sequence according to the present invention, 
to produce a RNA corresponding to the DNA sequence. The RNA may be 
synthesised in vivo or in vitro or by chemical synthesis to produce a sequence 

15 corresponding to a DNA sequence by methods well known in the art. In this 
specification, where the degree of sequence similarity between an RNA and DNA 
is such that the strand of the DNA could encode the RNA, then the RNA is said to 
"conrespond" to that DNA. 

The tenm "isolated" means that the material is removed from its original 
20 environment (eg. the natural environment if it is naturally occurring). For example, 
a naturally occurring nucleic acid or polypeptide present in a living plant is not 
isolated, but the same nucleic acid or polypeptide separated from some or all of 
the coexisting materials in the natural system, is isolated. Such an isolated nucleic 
acid could be part of a vector and/or such a nucleic acid could be part of a 
25 composition, and still be isolated in that such a vector or composition is not part of 
its natural environment. An isolated polypeptide could be part of a composition and 
still be isolated in that such a composition is not part of its natural environment. 

By 'Yunctionally active" In respect of a nucleic acid it Is meant that the 
fragment or variant is capable of modifying organic acid biosynthesis in a plant. A 
30 variant in this context can be an analogue, derivative or mutant and includes 
naturally occurring allelic variants and non-naturally occurring variants. Additions, 
deletions, substitutions and derivatizations of one or more of the nucleotides are 
contemplated so long as the modifications do not result in loss of functional activity 
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of the fragment or variant Preferably the functionally active fragment or variant 
has at least approximately 80% identity to the functional part of the above 
mentioned sequence, more preferably at least approximately 90% identity, most 
preferably at least approximately 95% identity. Such functionally active variants 
5 and fragments include, for example, those having nucleic acid changes which 
result in conservative amino acid substitutions of one or more residues in the 
corresponding amino acid sequence. Preferably the fragment has a size of at least 
30 nucleotides, more preferably at least 45 nucleotides, most preferably at least 60 
nucleotides. 

10 By "functionally active" in respect of a polypeptide it is meant that the 

fragment or variant has one or more of the biological properties of the proteins CS, 
CS-like, MDH, MDH-like, PEPC and PEPC-like. A variant in this context includes 
additions, deletions, substitutions and derivatizations of one or more of the amino 
acids are contemplated so long as the modifications do not result in loss of 

15 functional activity of the fragment or variant. Preferably the functionally active 
fragment or variant has at least approximately 60% identity to the functional part of 
the above mentioned sequence, more preferably at least approximately 80% 
identity, most preferably at least approximately 90% identity. Such functionally 
active variants and fragments include, for example, those having conservative 

20 amino acid substitutions of one or more residues in the corresponding amino acid 
sequence. Preferably the fragment has a size of at least 10 amino acids, more 
preferably at least 15 amino acids, most preferably at least 20 amino acids. 

The term "construct" as used herein refers to an artificially assembled or 
isolated nucleic acid molecule which includes the gene of interest. In general a 

25 construct may include the gene or genes of Interest, a marker gene which In some 
cases can also be the gene of interest and appropriate regulatory sequences. It 
should be appreciated that the inclusion of regulatory sequences in a constmct is 
optional, for example, such sequences may not be required in situations where the 
regulatory sequences of a host cell are to be used. The term construct includes 

30 vectors but should not be seen as being limited thereto. 

The term "vector^ as used herein encompasses both cloning and 
expression vectors. Vectors are often recombinant molecules containing nucleic 
acid molecules from several sources. 
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By "operatively linked" in respect of a regulatory element, nucleic acid or 
nucleic acid fragment and terminator, it meant that the regulatory element is 
capable of causing expression of said nucleic acid or nucleic acid fragment in a 
plant cell and said temninator is capable of tenninating expression of said nucleic 
5 acid or nucleic acid fragment in a plant cell. Preferably, said regulatory element is 
upstream of said nucleic acid or nucleic acid fragment and said temninator Is 
downstream of said nucleic acid or nucleic acid fragment. 

By "an effective amount" of a nucleic acid or nucleic acid fragment it is 
meant an amount sufficient to result in an identifiable phenotypic trait in said plant, 

10 or a plant, plant seed or other plant part derived therefrom. Such amounts can be 
readily determined by an appropriately skilled person, taking into account the type 
of plant, the route of administration and other relevant factors. Such a person will 
readily be able to determine a suitable amount and method of administration. See, 
for example, Maniatis et al. Molecular Cloning: A Laboratory Manual, Cold Spring 

15 Harbor Laboratory, Cold Spring Harbor, the entire disclosure of which is 
incorporated herein by reference. 

It will also be understood that the term "comprises" (or its grammatical 
variants) as used in this specification is equivalent to the term Includes" and 
should not be taken as excluding the presence of other elements or features. 

20 Such nucleic acids or nucleic acid fragments could be assembled to form a 

consensus contig. As used herein, the term "consensus contig" refers to a 
nucleotide sequence that is assembled from two or more constituent nucleotide 
sequences that share common or overiapping regions of sequence homology. For 
example, the nucleotide sequence of two or more nucleic acids or nucleic acid 

25 fragments can be compared and aligned in order to Identify common or 
overiapping sequences. Where common or overlapping sequences exist between 
two or more nucleic acids or nucleic acid fragments, the sequences (and thus their 
corresponding nucleic acids or nucleic acid fragments) can be assembled into a 
single contiguous nucleotide sequence. 

30 In a preferred embodiment of this aspect of the invention, the substantially 

purified or isolated nucleic acid or nucleic acid fragment encodes a CS or CS-like 
polypeptide and including a nucleotide sequence selected from the group 
consisting of (a) sequences shown in Figures 1, 3, 4. 6, 7, 9, 99. 101, 102. 104, 
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114, 118 and 122 hereto (SEQ ID NOS 1, 3 to 10, 11. 13 to 16, 17, 19, 327, 329 to 
335, 336, 338 to 344, 349, 351, 353 respectively): (b) complements of the 
sequences recited In (a); (c) sequences antlsense to the sequences recited in (a) 
and (b); (d) functionally active fragments and variants of the sequences recited in 
5 (a), (b) and (c); and (e) RNA sequences corresponding to the sequences recited in 

(a) , (b),(c)and(d). 

In a further preferred embodiment of this aspect of the invention, the 
substantially purified or isolated nucleic acid or nucleic acid fragment encodes a 
MDH or MDH-like polypeptide and including a nucleotide sequence selected from 

10 the group consisting of (a) sequence shown in Figures 11, 13, 14, 16, 17, 19, 21, 
23, 25, 26. 28, 30, 31, 33, 35. 37. 38. 40. 55, 57, 58. 60, 61, 63, 64, 66, 67, 69, 70, 
72, 73. 75. 76, 78, 79, 81, 82 and 84 hereto (SEQ ID NOS. 21, 23 to 29; 30, 32 to 
33, 34, 36, 38, 40, 42 to 43, 44, 46, 48 to 110, 111, 113, 115, 117 to 182, 183, 
185, 205, 207 to 217, 218, 220 to 251, 252, 254 to 270, 271, 273 to 275, 276, 278 

15 to 287, 288, 290 to 292, 293, 295 to 296. 297, 299 to 301 , 304 to 305, 306, 308); 

(b) complements of the sequences recited in (a); (c) sequences antlsense to the. 
sequences recited in (a) and (b); (d) functionally active fragments and variants of 
the sequences recited in (a), (b) and (c); and (e) RNA sequences corresponding to 
the sequences recited in (a), (b), (c) and (d). 

20 in a further preferred embodiment of this aspect of the invention, the 

substentially purified or isolated nucleic acid or nucleic acid fragment encodes a 
PEPC or PEPC-IIke polypeptide and including a nucleotide sequence selected 
firom the group consisting of (a) sequences shown in Figures 42, 44, 46, 47, 49, 
51, 53, 86, 88, 89, 91, 92, 94, 95, 97 and 110 hereto (SEQ ID NOS 187, 189, 191 

25 to 1 97, 1 99, 201 , 203, 31 0, 31 2 to 314, 31 5, 31 7 to 31 8, 31 9, 321 to 322, 323, 325 
and 347 respectively); (b) complements of the sequences recited in (a); (c) 
sequences antlsense to the sequences recited in (a) and (b); (d) functionally active 
fragments and variants of the sequences recited in (a), (b) and (c); and (e) RNA 
sequences corresponding to the sequences recited in (a), (b), (c) and (d). 

30 Nucleic acids or nucleic acid fragments encoding at least a portion of 

several OS, MDH and PEPC polypeptides have been isolated and identified. 
Genes encoding other OS or CS-like, MDH or MDH-like and PEPC or PEPC-like 
proteins, either as cDNAs or igenomic DNAs, may be isolated directly by using ail 
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or a portion of the nucleic acids or nucleic acid fragments of tlie present invention 
as hybridisation probes to screen libraries from the desired plant employing the 
methodology well known to those skilled in the art. Specific oligonucleotide probes 
based upon the nucleic acid sequences of the present invention may be designed 
5 and synthesized by methods known in the art. Moreover, the entire sequences 
may be used directly to synthesize DNA probes by methods known to the skilled 
artisan such as random primer DNA labelling, nick translation, or end-labelling 
techniques, or RNA probes using available in vitro transcription systems. In 
addition, specific primers may be designed and used to amplify a part or all of the 
10 sequences of the present invention. The resulting amplification products may be 
labelled directly during amplification reactions or labelled after amplification 
reactions, and used as probes to isolate full-length cDNA or genomic fragments 
under conditions of appropriate stringency. 

In addition, short segments of the nucleic acids or nucleic acid fragments of 

15 the present invention may be used in protocols to amplify longer nucleic acids or 
nucleic acid fragments encoding homologous genes from DNA or RNA. For 
example, polymerase chain reaction may be performed on a library of cloned 
nucleic acid fragments wherein the sequence of one primer is derived from the 
nucleic acid sequences of the present invention, and the sequence of the other 

20 primer takes advantage of the presence of the polyadenylic acid tracts to the 3' 
end of the mRNA precursor encoding plant genes. Alternatively, the second primer 
sequence may be based upon sequences derived from the cloning vector. For 
example, those skilled in the art can follow the RACE protocol (Frohman et at. 
(1988) Proc. Nati. Acad Sci. USA 85:8998, the entire disclosure of which is 

25 incorporated herein by reference) to generate cDNAs by using PGR to amplify 
copies of the region between a single point in the transcript and the 3' or 5' end. 
Using commercially available 3' RACE and 5' RACE systems (BRL), specific 3' or 
5' cDNA fragments may be isolated (Ohara et aL (1989; Proc. Natl. Acad Sci USA 
86:5673; Loh et al, (1989) Science 243:217, the entire disclosures of which are 

30 incorporated herein by reference). Products generated by the 3' and 5' RACE 
procedures may be combined to generate full-length cDNAs. 

In a further aspect of the present invention there is provided a substantially 
purified or isolated polypeptide from a clover {Trifolium), medic {Medicago)^ 
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ryegrass (Lolium) or fescue (Festuca) species, selected from the group consisting 
of CS or CS-like, MDH or MDH-iike and PEPC or PEPOIike polypeptides; and 
functionally active fragments and variants of these polypeptides. 

The clover (Trifolium), medic (Medicago), ryegrass {Lolium) or fescue 
5 (Festuca) species may be of any suitable type, including white clover (Trifolium 
repens), red clover {Trifolium pratense), subterranean clover {Trifolium 
subterraneum), alfalfa {Medicago sativa), Italian or annual ryegrass {Lolium 
multiflorum), perennial ryegrass {Lolium perenne), tall fescue {Festuca 
arundinacea), meadow fescue {Festuca pratensis) and red fescue {Festuca rubra). 
10 Preferably the species is a clover or a ryegrass, more preferably white clover (7. 
repens) or perennial ryegrass (L. perenne). 

In a preferred emt>odiment of this aspect of the invention, the substantially 
purified or isolated CS or CS-like polypeptide includes an amino acid sequence 
selected from the group consisting of sequences shown in Figures 2, 5, 8, 10, 100, 
15 103, 115, 1 19, 123 hereto (SEQ ID NOS 2, 12, 18, 20, 328, 337, 350, 352 and 354 
respectively); and functionally active fragments and variants thereof. 

In a further preferred embodiment of this aspect of the invention, the 
substantially purified or isolated MDH or MDH-like polypeptide includes an amino 
acid sequence selected from the group consisting of sequences shown in Figures 
20 12, 15, 18, 20, 22, 24, 27, 29. 32, 34, 36, 39, 41 , 56, 59. 62, 65, 68, 71 . 74, 77, 80, 
83 and 85 hereto (SEQ ID NOS 22, 31, 35, 37, 39, 41, 45, 47, 112, 114, 116, 184, 
186, 206, 219, 253, 272, 277, 289, 294, 297. 303. 307 and 309, respectively) and 
functionally active fragments and variants thereof. 

In a further preferred embodiment of this aspect of the invention, the 
25 substantially purified or isolated PEPC or PEPC-like polypeptide includes an 
amino acid sequence selected from the group consisting of sequences shown in 
Figures 43, 45, 48, 50, 52, 54, 87, 90, 93, 96, 98 and 111 hereto (SEQ ID NOS 
188, 190. 198, 200, 202. 204, 311, 316, 320. 324, 326, and 348 .respectively); and 
functionally active fragments and variants thereof. 

30 In a further embodiment of this aspect of the invention, there is provided a 

polypeptide produced (e.g. recombinantly) from a nucleic acid or nucleic acid 
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fragment according to the present invention. Teciiniques for recombinantly 
producing polypeptides are knowm to those skilled In the art. 

Availability of the nucleotide sequences of the present invention and 
deduced amino acid sequences facilitates immunological screening of cDNA 
5 expression libraries. Synthetic peptides representing portions of the instant amino 
acid sequences may be synthesized. These peptides may be used to immunise 
animals to produce polyclonal or monoclonal antibodies with specificity for 
peptides and/or proteins including the amino acid sequences. These antibodies 
may be then used to screen cDNA expression libraries to isolate full-length cDNA 
1 0 clones of interest. 

A genotype is the genetic constitution of an individual or group. Variations in 
genotype are important in commercial breeding programs, in determining 
parentage, in diagnostics and fingerprinting, and the like. Genotypes can be 
readily described in terms of genetic markers. A genetic marker ident'rfies a 

15 specific region or locus in the genome. The more genetic markers, the finer 
defined Is the genotype. A genetic marker becomes particulariy useful when it is 
allelic between organisms because it then may serve to unambiguously identify an 
individual. Furthermore, a genetic marker becomes particulariy useful when it is 
based on nucleic acid sequence information that can unambiguously establish a 

20 genotype of an individual and when the function encoded by such nucleic acid is 
known and is associated with a specific trait. Such nucleic acids and/or nucleotide 
sequence infonnation including single nucleotide polymorphisms (SNPs), 
variations in single nucleotides between allelic forms of such nucleotide sequence, 
may be used as perfect markers or candidate genes for the given trait. 

25 Applicants have identified a number of SNPs of the nucleic acids or nucleic 

acid fragments of the present invention. These are Indicated (marked with grey on 
the black background) in the figures that show multiple alignments of nucleotide 
sequences of nucleic acid fragments contributing to consensus contig sequences. 
See for example, Figures 3, 6, 9, 13, 16, 30. 37, 57, 60, 63. 79, 89, 92 and 104 

30 hereto. 

Accordingly, in a further aspect of the present invention, there is provided a 
substantially purified or isolated nucleic acid or nucleic acid fragment including a 
single nucleotide polymorphism (SNP) from a nucleic acid or nucleic acid fragment 
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acoording to the present invention, for example a SNP from a nucleic acid 
sequence shown In Figures 3, 6, 9, 13. 16, 30, 37. 57. 60. 63, 66. 67. 72. 78. 88. 
94, 101 and 104 hereto; or complements or sequences antisense thereto, and 
functionally active fragments and variants thereof. The invention further provides a 
5 substantially purified or isolated nucleic acid or nucleic acid fragment including a 
single nucleotide polymorphism (SNP) isolated by the method of this invention. 

In a still further aspect of the present invention there is provided a method 
of isolating a nucleic acid or nucleic acid fragment of the present invention 
including a SNP. said method including sequencing nucleic acid fragments from a 
10 nucleic acid library. The method includes the step of identifying the SNP. 

The nucleic acid library may be of any suitable type and is preferably a 
cDNA library. 

The nucleic acid or nucleic acid fragment may be isolated from a 
recombinant plasmid or may be amplified, for example using polymerase chain 
15 reaction. 

The sequencing may be performed by techniques known to those skilled in 
the art. 

In a still further aspect of the present invention, there is provided use of the 
nucleic acids or nucleic acid fragments of the present invention including SNPs, 
20 and/or nucleotide sequence information thereof, as molecular genetic markers. 

In a still further aspect of the present invention there is provided use of a 
nucleic acid or nucleic acid fragment of the present invention, and/or nucleotide 
sequence information thereof, as a molecular genetic marker. 

More particularly, nucleic acids or nucleic acid fragments according to the 
25 present invention and/or nucleotide sequence infonmation thereof may be used as 
a molecular genetic marker for quantitative trait loci (QTL) tagging, QTL mapping, 
DNA fingerprinting and In mariner assisted selection, particulariy in clovers, alfalfa, 
ryegrasses and fescues. Even more particulariy, nucleic acids or nucleic acid 
fragments according to the present invention and/or nucleotide sequence 
30 infomnation thereof may be used as molecular genetic markers in plant 
improvement in relation to plant tolerance to abiotic stresses such aluminium toxic 
acid soils; in relation to nutrient acquisition efficiency including phosphorus; in 
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relation to nitrogen fixation; in relation to nodulation. Even more particularly, 
sequence infonmation revealing SNPs in allelic variants of the nucleic acids or 
nucleic acid fragments of the present invention and/or nucleotide sequence 
information thereof may be used as molecular genetic markers for QTL tagging 
5 and mapping and in marker assisted selection, particularly in clovers, alfalfa, 
ryegrasses and fescues. 

In a still further aspect of the present invention there is provided a construct 
or vector including a nucleic acid or nucleic acid fragment according to the present 
invention. 

10 In a particularly preferred embodiment the construct or vector may include 

nucleic acids or nucleic acid fragments encoding both CS or CS-like and MDH or 
MDH-like polypeptides. 

In yet another preferred embodiment the construct or vector may include 
nucleic acids or nucleic acid fragments encoding both MDH or MDH-like and 
1 5 PEPC or PEPC-like polypeptides. 

In yet another preferred embodiment the construct or vector may include 
both CS or CS-like and PEPC or PEPC-like polypeptides. 

In another preferred embodiment the construct or vector may include 
nucleic acids or nucleic acid fragments encoding all three of CS or CS-like, MDH 
20 or MDH-like and PEPC or PEPC-like polypeptides. 

In a preferred embodiment of this aspect of the invention, the vector may 
include one or more regulatory element such as a promoter, one or more nucleic 
acids or nucleic acid fragments according to the present invention and one or 
more terminators; said one or more regulatory elements, one or more nucleic 
25 acids or nucleic acid fragments and one or more terminators being operatively 
linked. 

In a preferred embodiment of the present invention the vector may contain 
nucleic acids or nucleic acid fragments encoding both CS or CS-like and MDH or 
MDH-like polypeptides, operatively linked to a regulatory element or regulatory 
30 elements, such that both CS or CS-like and MDH or MDH-like polypeptides are 
expressed. 
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In another prefen-ed embodiment of the present invention the vector may 
contain nucleic acids or nucleic acid fragments encoding both CS or CS-like and 
PEPC or PEPC-like polypeptides, operatively linked to a regulatory element or 
regulatory elements, such that both CS or CS-like and PEPC or PEPC-like 
5 polypeptides are expressed. 

in yet another particularly preferred embodiment of the present invention 
the vector or construct may contain nucleic acids or nucleic acid fragments 
encoding both IVIDH or MDH-like and PEPC or PEPC-like polypeptides, 
operatively linked to a regulatory element or regulatory elements, such that both 
1 0 MDH or MDH-like and PEPC or PEPC-like polypeptides are expressed. 

In another particularly preferred embodiment of the present Invention the 
vector may contain nucleic acids or nucleic acid fragments encoding all three of 
CS or CS-like, MDH or MDH-like and PEPC or PEPC-like, operatively linked to a 
regulatory element or regulatory elements, such that ail three of CS or CS-like, 
15 MDH or MDH-like and PEPC or PEPC-like polypeptides are expressed. 

The vector may be of any suitable type and may be viral or non-viral. The 
vector may be an expression vector. Such vectors include chromosomal, non- 
chromosomal and synthetic nucleic acid sequences, eg. derivatives of plant 
viruses; bacterial plasmids; derivatives of the Tl plasmid firom Agrobacterium 
20 tumefaciens, derivatives of the Ri plasmid from Agrobacterium rtiizogenes; phage 
DNA; yeast artificial chromosomes; bacterial artificial chromosomes; binary 
bacterial artificial chromosomes; vectors derived from combinations of plasmids 
and phage DNA. However, any other vector may be used as long as it Is 
replicable, integrative or viable in the plant cell. 

25 The regulatory element and temninator may be of any suitable type and may 

be endogenous to the target plant cell or may be exogenous, provided that they 
are functional in the target plant cell. 

Preferably the regulatory element is a promoter. A variety of promoters 
which may be employed in the vectors of the present invention are well known to 
30 those skilled In the art. Factors influencing the choice of promoter include the 
desired tissue specificity of the vector, and whether constitutive or indudble 
expression Is desired and the nature of the plant cell to be transformed (eg. 
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monocotyledon or dicotyledon). Partlculariy suitable constitutive promoters include 
the Cauliflower Mosaic Virus 35S (CaMV 35S) promoter, the maize Ubiquitin 
promoter, and the rice Actin promoter. Particularly suitable tissue-specific 
promoters include root-prevalent promoters. 

5 A variety of temninators which may be employed in the vectors of the 

present Invention are also well known to those skilled in the art. The terminator 
may be from the same gene as the promoter sequence or a different gene. 
Particularly suitable temiinators are poiyadenylation signals, such as the CaMV 
35S polyA and other temninators from the nopaline synthase (nos) and the 
10 octopine synthase (pes) genes. 

The vector, in addition to the regulatory element, the nucleic acid or nucleic 
acid fragment of the present Invention and the terminator, may include further 
elements necessary for expression of the nucleic acid or nucleic acid fragment, in 
different combinations, for example vector backbone, origin of replication (ori), 

15 multiple cloning sites, spacer sequences, enhancers, introns (such as the maize 
Ubiquitin Ubi intron), antibiotic resistance genes and other selectable marker 
genes [such as the neomycin phosphotransferase {npt2) gene, the hygromycin 
phosphotransferase (hph) gene, the phosphinothricin acetyltransferase (bar or pat) 
gene, the phospho-mannose isomerase (pmi) gene], and reporter genes (such as 

20 beta-glucuronldase (GUS) gene (gusA)]. The vector may also contain a ribosome 
binding site for translation initiation. The vector may also Include appropriate 
sequences for amplifying expression. 

As an alternative to use of a selectable marker gene to provide a 
phenotypic trait for selection of transfonned host cells, the presence of the vector 
25 In transfonned cells may be detennlned by other techniques well known in the art, 
such as PCR (polymerase chain reaction), Southern blot hybridisation analysis, 
histochemical GUS assays, northem and Western blot hybridisation analyses. 

Those skilled in the art will appreciate that the various components of the 
vector are operatively linked, so as to result in expression of said nucleic acid or 
30 nucleic acid fragment. Techniques for operatively linking the components of the 
vector of the present invention are well known to those skilled in the art. Such 
techniques include the use of linkers, such as synthetic linkers, for example 
including one or more restriction enzyme sites. 
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The vectors of the present invention may be incorporated into a variety of 
plants, including monocotyledons (such as grasses from the genera Lolium, 
Festuca, Paspalum, Pennisetum, Panicum and other forage and turfgrasses, corn, 
oat, sugarcane, wheat and bariey), dicotyledons (such as Arabidopsis, tobacco, 
5 clovers, medics, eucalyptus, potato, sugarbeet, canola, soybean, chickpea) and 
gymnosperms. In a preferred embodiment, the vectors may be used to transform 
monocotyledons, preferably grass species such as ryegrasses {Lolium species) 
and fescues (Festuca species), more preferably perennial ryegrass, including 
forage- and turf-type cultivars. In an alternate preferred embodiment, the vectors 
10 may be used to transform dicotyledons, preferably forage legume species such as 
clovers {Trifolium species) and medics {Medicago species), more preferably white 
clover {Trifolium repens), red clover (Trifolium pratense), subterranean clover 
(Trifolium subterraneum) and alfalfa (Medicago sativa). Clovers, alfalfa and medics 
are key pasture legumes in temperate climates throughout the wortd. 

15 Techniques for incorporating the vectors of the present invention into plant 

cells (for example by transduction, transfection or transformation) are known to 
those skilled in the art. Such techniques include Agrobacterium mediated 
introduction, electroporation to tissues, cells and protoplasts, protoplast fusion, 
Injection into reproductive organs, injection Into Immature embryos and high 

20 velocity projectile Introduction to cells, tissues, calli, Immature and mature 
embryos. The choice of technique will depend largely on the type of plant to be 
transformed. 

Cells incorporating the vectors of the present invention may be selected, as 
described above, and then cultured in an appropriate medium to regenerate 
25 transformed plants, using techniques well known in the art. The culture conditions, 
such as temperature, pH and the like, will be apparent to the person skilled in the 
art. The resulting plants may be reproduced, either sexually or asexually, using 
methods well known in the art, to produce successive generations of transformed 
plants. 

30 In a further aspect of the present Invention there is provided a plant cell, 

plant, plant seed or other plant part, including, e.g. transformed with, a vector, 
nucleic acid or nucleic acid fragment of the present invention. 
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The plant cell, plant, plant seed or other plant part may be from any suitable 
species, including monocotyledons, dicotyledons and gymnospenns. In a 
prefen-ed embodiment the plant cell, plant, plant seed or other plant part may be 
from a monocotyledon, preferably a grass species, more preferably a. ryegrass 
5 {Lolium species) or fescue (Festuca species), more preferably perennial ryegrass, 
including both forage- and turf-type cultivars. In an alternate preferred embodiment 
the plant cell, plant, plant seed or other plant part may be from a dicotyledon, 
preferably forage legume species such as clovers {Trifolium species) and medics 
{Medicago species), more preferably white clover {Trifolium repens), red clover 
10 {Trifolium pratense), subterranean clover {Trifolium subterraneum) and alfalfa 
{Medicago sativa). 

The present invention also provides a plant, plant seed or other plant part, 
or a plant extract derived from a plant cell of the present invention. 

The present invention also provides a plant, plant seed or other plant part, 
15 or a plant extract derived from a plant of the present invention. 

In a further aspect of the present invention there is provided a method of 
modifying organic acid biosynthesis; of modifying organic acid secretion; of 
modifying phosphorous and other nutrients acquisition efficiency in plants; of 
modifying aluminium and acid soil tolerance In plants; of modifying nitrogen 
20 fixation and nodule function, said method including introducing into said plant an 
effective amount of a nucleic acid or nucleic acid fragment according to the 
present invention. Preferably the nucleic acid or nucleic acid fragment is part of a 
vector. 

Using the methods and products of the present invention, organic acid 
25 biosynthesis; organic acid secretion; phosphorous and other plant nutrient 
acquisition efficiency; aluminium and acid soil tolerance; nitrogen fixation and 
nodule function, may be Increased or otherwise altered, for example by 
incorporating additional copies of a sense nucleic acid or nucleic acid fragment of 
the present invention. They may be decreased or othenwise altered, for example 
30 by incorporating an antisense nucleic acid or nucleic acid fragment of the present 
invention. 
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In a particulariy preferred embodiment the method may include introducing 
into said plant nucleic acids or nucleic acid fragments encoding both CS or CS-like 
and MDH or MDH-like polypeptides. 

In another preferred embodiment the method may include introducing into 
5 said plant nucleic acids or nucleic acid fragments encoding both CS or CS-lil<e and 
PEPC or PEPC polypeptides. 

In yet another preferred embodiment the method may include introducing 
into said plant nucleic acids or nucleic acid fragments encoding both MDH or 
MDH-like and PEPC or PEPC-like polypeptides. 

10 In an even more preferred embodiment the method may include introducing 

into said plant nucleic acids or nucleic acid fragments encoding all three of CS or 
CS-like, MDH or MDH-like and PEPC or PEPC-like polypeptides. 

The present invention will now be more fully described with reference to the 
accompanying Examples and drawings. It should be understood, however, that the 
15 description following is illustrative only and should not be taken in any way as a 
restriction on the generality of the invention described above. 

In the Figures 

Figure 1 shows the consensus contig nucleotide sequence of LpCSa. 

Figure 2 shows the deduced amino acid sequence of LpCSa. 

20 Figure 3 shows the nucleotide sequences of the nucleic acid fragments 

contributing to the consensus contig sequence LpCSa. 

Figure 4 shows the consensus contig nucleotide sequence of LpCSb. 

Figure 5 shows the deduced amino acid sequence of LpCSb. 

Figure 6 shows the nucleotide sequences of the nucleic acid fragments 
25 contributing to the consensus contig sequence LpCSb. 

Figure 7 shows the nucleotide sequence of LpCSc. 

Figure 8 shows the deduced amino acid sequence of LpCSc. 

Figure 9 shows the nucleotide sequence of LpCSd. 

Figure 10 shows the deduced amino acid sequence of LpCSd. 
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Figure 1 1 shows the consensus contig nucleotide sequence of Lpl\^DHa. 

Figure 12 shows the deduced amino acid sequence of LpMDIHa. 

Figure 13 shows the nucleotide sequences of the nucleic acid fragments 
contributing to the consensus contig sequence LpMDHa. 

5 Figure 14 shows the consensus contig nucleotide sequence of LpMDHb. 

Figure 15 shows the deduced amino acid sequence of LpMDHb. 

Figure 16 shows the nucleotide sequences of the nucleic acid fragments 
contributing to the consensus contig sequence LpMDHb. 

Figure 17 shows the nucleotide sequence of LpMDHc. 
10 Figure 18 shows the deduced amino acid sequence of LpMDHc. 

Figure 19 shows the nucleotide sequence of LpMDHd. 

Figure 20 shows the deduced amino acid sequence of LpMDHd. 

Figure 21 shows the nucleotide sequence of LpMDHe. 

Figure 22 shows the deduced amino acid sequence of LpMDHe. 
15 Figure 23 shows the consensus contig nucleotide sequence of LpMDHf. 

Figure 24 shows the deduced amino acid sequence of LpMDHf. 

Figure 25 shows the nucleotide sequences of the nucleic acid fragments 
contributing to the consensus contig sequence LpMDHf. 

Figure 26 shows the nucleotide sequence of LpMDHg. 

20 Figure 27 shows the deduced amino acid sequence of LpMDHg. 

Figure 28 shows the consensus contig nucleotide sequence of LpMDHh. 

Figure 29 shows the deduced amino acid sequence of LpMDHh. 

Figure 30 shows the nucleotide sequences of the nucleic acid fragments 
contributing to the consensus contig sequence LpMDHh. 

25 Figure 31 shows the nucleotide sequence of LpMDHi. 

Figure 32 shows the deduced amino acid sequence of LpMDHi. 

Figure 33 shows the nucleotide sequence of LpMDHj. 
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Figure 34 shows the deduced amino acid sequence of LpMDHj. 

Figure 35 shows the consensus contig nucleotide sequence of LpMDHk. 

Figure 36 shows the deduced amino acid sequence of LplViDHi<. 

Figure 37 shows the nucleotide sequences of the nucleic acid fragments 
5 contributing to the consensus contig sequence LpMDHk. 

Figure 38 shows the nucleotide sequence of LpMDHI. 

Figure 39 shows the deduced amino acid sequence of LpMDHI. 

Figure 40 shows the nucleotide sequence of LpMDHm. 

Figure 41 shows the deduced amino acid sequence of LpMDHm. 

10 Figure 42 shows the nucleotide sequence of LpPEPCa. 

Figure 43 shows the deduced amino acid sequence of LpPEPCa. 

Figure 44 shows the consensus contig nucleotide sequence of LpPEPCb. 

Figure 45 shows the deduced amino acid sequence of LpPEPCb. 

Figure 46 shows the nucleotide sequences of the nucleic acid fragments 
15 contributing to the consensus contig sequence LpPEPCb. 

Figure 47 shows the nucleotide sequence of LpPEPCc. 

Figure 48 shows the deduced amino acid sequence of LpPEPCc. 

Figure 49 shows the nucleotide sequence of LpPEPCd. 

Figure 50 shows the deduced amino acid sequence of LpPEPCd. 
20 Figure 51 shows the nucleotide sequence of LpPEPCe. 

Figure 52 shows the deduced amino acid sequence of LpPEPCe. 

Figure 53 shows the nucleotide sequence of LpPEPCf. 

Figure 54 shows the deduced amino acid sequence of LpPEPCf. 

Figure 55 shows the consensus contig nucleotide sequence of TrMDHa. 

25 Figure 56 shows the deduced amino acid sequence of TrMDHa. 

Figure 57 shows the nucleotide sequences of the nucleic acid fragments 
contributing to the consensus contig sequence TrMDHa. 
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Figure 58 shows the consensus contig nucleotide sequence of TrMDHb. 

Figure 59 shows the deduced amino acid sequence of TrMDHb. 

Figure 60 shows the nucleotide sequences of the nucleic acid fragments 
contributing to the consensus contig sequence TrMDHb. 

5 Figure 61 shows the consensus contig nucleotide sequence of TrMDHc. 

Figure 62 shows the deduced amino acid sequence of TrMDHc. 

Figure 63 shows the nucleotide sequences of the nucleic acid fragments 
contributing to the consensus contig sequence TrMDHc. 

Figure 64 shows the consensus contig nucleotide sequence of TrMDHd. 

10 Figure 65 shows the deduced amino acid sequence of TrMDHd. 

Figure 66 shows the nucleotide sequences of the nucleic acid fragments 
contributing to the consensus contig sequence TrMDHd. 

Figure 67 shows the consensus contig nucleotide sequence of TrMDHe. 

Figure 68 shows the deduced amino acid sequence of TrMDHe. 

15 Figure 69 shows the nucleotide sequences of the nucleic acid fragments 

contributing to the consensus contig sequence TrMDHe. 

Figure 70 shows the consensus contig nucleotide sequence of TrMDHf. 

Figure 71 shows the deduced amino acid sequence of TrMDHf. 

Figure 72 shows the nucleotide sequences of the nucleic acid fragments 
20 contributing to the consensus contig sequence TrMDHf. 

Figure 73 shows the consensus contig nucleotide sequence of TrMDHg. 

Figure 74 shows the deduced amino acid sequence of TrMDHg. 

Figure 75 shows the nucleotide sequences of the nucleic acid fragments 
contributing to the consensus contig sequence TrMDHg. 

25 Figure 76 shows the consensus contig nucleotide sequence of TrMDHh. 

Figure 77 shows the deduced amino acid sequence of TrMDHh. 

Figure 78 shows the nucleotide sequences of the nucleic acid fragments 
contributing to the consensus contig sequence TrMDHh. 
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Figure 79 shows the consensus contig nucleotide sequence of TrIVIDHi. 

Figure 80 shows the deduced amino acid sequence of TrMDHi. 

Figure 81 shows the nucleotide sequences of the nucleic acid fragments 
contributing to the consensus contig sequence TrMDHi. 

5 Figure 82 shows the nucleotide sequence of TrMDHj. 

Figure 83 shows the deduced amino acid sequence of TrMDHj. 

Figure 84 shows the nucleotide sequence of TrMDHlc. 

Figure 85 shows the deduced amino acid sequence of TrMDHk. 

Figure 86 shows the consensus contig nucleotide sequence of TrPEPCa. 

10 Figure 87 shows the deduced amino acid sequence of TrPEPCa. 

Figure 88 shows the nucleotide sequences of the nucleic acid fragments 
contributing to the consensus contig sequence TrPEPCa. 

Figure 89 shows the consensus contig nucleotide sequence of TrPEPCb. 

Figure 90 shows the deduced amino acid sequence of TrPEPCb. 

15 Figure 91 shows the nucleotide sequences of the nucleic acid fragments 

contributing to the consensus contig sequence TrPEPCb. 

Figure 92 shows the consensus contig nucleotide sequence of TrPEPCc. 

Figure 93 shows the deduced amino acid sequence of TrPEPCc. 

Figure 94 shows the nucleotide sequences of the nucleic acid fragments 
20 contributing to the consensus contig sequence TrPEPCc. 

Figure 95 shows the nucleotide sequence of TrPEPCd. 

Figure 96 shows the deduced amino acid sequence of TrPEPCd. 

Figure 97 shows the nucleotide sequence of TrPEPCe. 

Figure 98 shows the deduced amino acid sequence of TrPEPCe. 

25 Figure 99 shows the consensus contig nucleotide sequence of TrCSa. 

Figure 100 shows the deduced amino acid sequence of TrCSa. 
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Figure 101 shows the nucleotide sequences of the nucleic acid fragments 
contributing to the consensus contig sequence TrCSa. 

Figure 102 shows the consensus contig nucleotide sequence of TrCSb. 

Figure 103 shows the deduced amino acid sequence of TrCSb. 

5 Figure 104 shows the nucleotide sequences of the nucleic acid fragments 

contributing to the consensus contig sequence TrCSb. 

Figure 105 shows the plasmid map in pGEM-T Easy of TrMDH. 

Figure 1 06 shows the nucleotide sequence of TrMDH. 

Figure 107 shows the deduced amino acid sequence of TrMDH. 

10 Figure 108 shows the plasmid map of sense construct of TrMDH in the binary 
vector pPZP221:35S2. 

Figure 109 shows the plasmid map in pGEM-T Easy of TrPEPC. 

Figure 110 shows the nucleotide sequence of TrPEPC. 

Figure 111 shows the deduced amino acid sequence of TrPEPC. 

15 Figure 112 shows the plasmid map of sense construct of TrPEPC in the binary 
vector pPZP221:35S^ 

Figure 113 shows the plasmid map in pGEM-T Easy of TrCSa. 

Figure 1 14 shows the nucleotide sequence of TrCSa. 

Figure 115 shows the deduced amino acid sequence of TrCSa. 

20 Figure 116 shows the plasmid map of sense construct of TrCSa in the binary 
vector pPZP221:35S2. 

Figure 117 shows the plasmid map in pGEM-T Easy of TrCSb. 

Figure 118 shows the nucleotide sequence of TrCSb. 

Figure 119 shows the deduced amino acid sequence of TrCSb. 

25 Figure 120 shows the plasmid map of sense construct of TrCSb in the binary 
vector pPZP221 :35S^. 

Figure 121 shows the plasmid map in pGEM-T Easy of TrCSd. 
Figure 122 shows the nucleotide sequence of TrCSd. 
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Figure 123 shows the deduced amino acid sequence of TrCSd. 

Figure 124 shows the plasmid map of sense construct of TrCSd in the binary 
vector pPZP221:35S^ 

Figure 125 shows the plasmid maps of the modular vector system comprising a 
5 binary base vector and 7 auxiliary vectors. 

Figure 126 shows an example of the modular binary transformation vector system 
comprising plasmid maps of the binary transformation vector backbone and 4 
expression cassettes for combinatorial expression of chimeric CS and MDH and 
PEPC genes in auxiliary vectors (A) and the plasmid map of the T-DNA region of 
1 0 the final binary transformation vector (B). 

Figure 127 shows the results of RT-PCR experiments performed as described in 
Example 6. Samples were isolated from: L, leaf; S, stolon; St, stolon tip; R, root; 
Rt, root tip. -C: negative (no reverse transcriptase) control; +C, positive (plasmid) 
control. The numbers indicate cycle numbers. A: phosphate transporter homolog; 
15 B: root iron transporter homolog. 

Figure 128 shows the screening of a white glover BAG library using the phosphate 
transporter cDNA as a probe (A); Southern hybridisation blot of six BAG clones 
identified in A using the same probe (B); physical map of the phosphate 
transporter genomic region including the coding region and the promoter region 
20 (G). 

Figure 129 shows white clover cotyledons, various stages of selection of plantlets 
transformed with a binary transformation vector constructed as described in 
Examples 4 and 5, transgenic white clover on root-inducing medium, and white 
clover plants transformed with genes involved in organic acid biosynthesis. 

25 Figure 130 shows the molecular analysis of transgenic white clover plants for the 
presence of the chimeric MDH gene with real time PGR amplification plot and 
agarose gel of PGR product. 

Figure 131 shows the molecular analysis of transgenic white clover plants for the 
presence of the chimeric PEPG gene with real time PGR amplification plot and 
30 agarose gel of PGR product. 
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Figure 132 shows the molecular analysis of transgenic white clover plants for the 
presence of the chimeric CS gene with real time PGR amplification plot and 
agarose gel of PGR product 

EXAMPLE 1 

5 Preparation of cDNA libraries, Isolation and sequencing of cDNAs coding for 
CS, CS-like, MDH, IMDH-like, PEPC and PEPC-like polypeptides from white 
clover {Trifollum repen^ and perennial ryegrass {JLollum perenn^ 

cDNA libraries representing mRNAs from various organs and tissues of 

white dover {TrifoUum repens) and perennial ryegrass (Lolium perenne) were 

10 prepared. The characteristics of the white clover and perennial ryegrass libraries, 

respectively, are described below (Tables 1 and 2). 



TABLE 1 

cDNA libraries from white clover (Trifo/ium repens) 



Library 


Organ/Tissue 


01 wc 


Whole seedling, light grown 


02wc 


Nodulated root 3, 5, 10. 14, 21 &28 day old seedling 


03wc 


Nodules pinched off roots of 42 day old riiizobium inoculated white 
clover 


04wc 


Nodulated white clover cut leaf and stem collected after 0, 1, 4, 6 &14 
h after cutting 


05wc 


Non-nodulated Inflorescences: <50% open, not fully open and fully 
open 


06wc 


Dark grown etiolated 


07wc 


Inflorescence - very eariy stages, stem elongation, < 15 petals, 15-20 
petals 


08wc 


seed frozen at-80°C, imbibed in dark overnight at 10°C 


09wc 


Drought stressed plants 


lOwc 


AMV infected leaf 


11wc 


WGMV infected leaf 
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Library 


Organ/Tissue 


12wc 


Phosphorus starved plants 


13wc 


Vegetative stolon tip 


14wc 


stolon root initials 


15wc 


Senescing stolon 


16wc 


Senescing leaf 


TABLE 2 

cDNA libraries from perennial ryegrass {Lolium perenn^ 


Library 


Organ/Tissue 


01rg 


Roots from 3-4 day old light-grown seedlings 


02rg 


Leaves from 3-4 day old light-grown seedlings 


03rg 


Etiolated 3-4 day old dark-grown seedlings 


U4rg 


vvnoie eiioioieu seeaiinyo \ i-o uay otu anu 1 1 uays oiu^ 


05rg 


Senescing leaves from mature plants 


06rg 


Whole etiolated seedlings (1-5 day old and 17 days old) 


07rg 


Roots from mature plants grown in hydroponic culture 


08rg 


Senescent leaf tissue 


09rg 


Whole tillers and sliced leaves (0. 1, 3, 6, 12 and 24 h after 
harvesting) 


10rg 


Embryogenic suspension-cultured cells 


11rg 


Non-embryogenic suspension-cultured cells 


12rg 


Whole tillers and sliced leaves (0, 1. 3, 6, 12 and 24 h after 
harvesting) 


13rg 


Shoot apices including vegetative apical meristems 


14rg 


Immature inflorescences including different stages of inflorescence 
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Library 


Organ/Tissue 




meristem and inflorescence development 


15rg 


Defatted pollen 


16rg 


LeaT Diaoes ana lear sneaxns ^rocL, rDco, cao, winc/\ suDiracieu^ 


17rg 


Senescing leaves and tillers 


18rg 


Drought-stressed tillers (pseudostems from plants subjected to PEG- 
simulated drought stress) 


19rg 


Non-embryogenic suspension-cultured cells subjected to osmotic 
stress (grown in media with half-strength salts) 

(1 , 2, 3, 4, 5, D, 24 and 48 n atter transfer) 


20rg 


Non-embryogenic suspension-cultured cells subjected to osmotic 
stress (grown in media with double-strength salts) 

(1 , Zj o, 4, o, o, ^4 ana 4o n aner iransi er; 


z irg 


simulated drought stress) 


22rg 


Spikelets with open and maturing florets 


23rg 


Mature roots (specific subtraction with leaf tissue) 



The cDNA libraries may be prepared by any of many methods available. 
For example, total RNA may be isolated using the Trizol method (GIboo-BRL, 
USA) or the RNeasy Plant Mini kit (Qiagen, Germany), following the 
5 manufacturers' instructions. cDNAs may be generated using the SMART PGR 
cDNA synthesis kit (Clontech, USA), cDNAs may be amplified by long distance 
polymerase chain reaction using the Advantage 2 PGR Enzyme system (Glontech, 
USA), cDNAs may be cleaned using the GeneGlean spin column (Bio 101, USA), 
tailed and size fractionated, according to the protocol provided by Clontech. The 
10 cDNAs may be introduced into the pGEM-T Easy Vector system 1 (Promega. 
USA) according to the protocol provided by Promega. The cDNAs in the pGEM-T 
Easy plasmid vector are transfected into Escherichia coU Epiojrian coll XLIO-Gold 
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ultra competent cells (Stratagene, USA) according to the protocol provided by 
Stratagene. 

Alternatively, the cDNAs may be introduced into plasmid vectors for first 
preparing the cDNA libraries in Uni-ZAP XR vectors according to the 
5 manufacturer's protocol (Stratagene Cloning Systems, La Jolla, CA, USA). The 
Uni-ZAP XR libraries are converted into plasmid libraries according to the protocol 
provided by Stratagene. Upon conversion, cDNA inserts will be contained in the 
plasmid vector pBluescript. In addition, the cDNAs may be introduced directly into 
precut pBluescript II SK(+) vectors (Stratagene) using T4 DNA ligase (New 
10 England Biolabs). followed by transf action into E. coli DH10B cells according to 
the manufacturer's protocol (GIBCO BRL Products). 

Once the cDNA inserts are in plasmid vectors, plasmid DNAs are prepared 
from randomly picked bacterial colonies containing recombinant plasmids, or the 
insert cDNA sequences are amplified via polymerase chain reaction using primers 

15 specific for vector sequences flanking the inserted cDNA sequences. Plasmid 
DNA preparation may be performed robotically using the Qiagen QiaPrep Turbo kit 
(QIagen, Gemnany) according to the protocol provided by Qiagen. Amplified insert 
DNAs are sequenced in dye-terminator sequencing reactions to generate partial 
cDNA sequences (expressed sequence tags or "ESTs"). The resulting ESTs are 

20 analysed using an Applied Biosystems ABI 3700 sequence analyser. 

EXAMPLE 2 
DNA sequence analyses 

The cDNA clones encoding OS, CS-like, MDH, MDH-like, PEPC and PEPC- 
like polypeptides were identified by conducting BLAST (Basic Local Alignment 

25 Search Tool; Altschul et al. (1993) J. MoL Biol. 215:403-410) searches. The cDNA 
sequences obtained were analysed for similarity to all publicly available DNA 
sequences contained in the eBloinformatics nucleotide database using the 
BLASTn algorithm provided by the National Center for Biotechnology Infomnatlon 
(NCBI). The DNA sequences were translated in all reading frames and compared 

30 for similarity to all publicly available protein sequences contained in the SWISS- 
PROT protein sequence database using BLASTx algorithm (v 2.0.1) (Gish and 
States (1993) Nature Genetics 3:266-272) provided by the NCBI. 
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The cDNA sequences obtained and identified were then used to identify 
additional identical and/or overlapping cDNA sequences generated using the 
BLASTn algorithm. The identical and/or overiapping sequences were subjected to 
a multiple alignment using the CLUSTALw algorithm, and to generate a consensus 
5 contig sequence derived from this multiple sequence alignment. The consensus 
contig sequence was then used as a query for a search against the SWISS-PROT 
protein sequence database using the BLASTx algorithm to confirm the initial 
identification. 

EXAMPLE 3 

10 Identification and fuli-iength sequencing of cDNAs encoding CS, MDI-i and 

PEPC polypeptides 

To fully characterise for the purposes of the generation of probes for 
hybridisation experiments and the generation of transformation vectors, a set of 
cDNAs encoding white clover CS, MDH and PEPC polypeptides was Identified 
1 5 and fully sequenced. 

Full-length cDNAs were identified from our EST sequence database using 
relevant published sequences (NCBI databank) as queries for BLAST searches. 
Full-length cDNAs were identified by alignment of the query and hit sequences 
using Sequencher (Gene Codes Corp., Ann Arbor, Ml 48108, USA). The original 

20 plasmid was then used to transform chemically competent XL-1 cells (prepared In- 
house, CaCb protocol). After colony PGR (using HotStarTaq. Qiagen) a minimum 
of three PCR-positive colonies per transformation were picked for initial 
sequencing with M13F and M13R primers. The resulting sequences were aligned 
with the original EST sequence using Sequencher to confirm identity and one of 

25 the three clones was picked for full-length sequencing, usually the one with the 
best initial sequencing result. 

Sequencing of all cDNAs was completed by primer walking, i.e. 
oligonucleotide primers were designed to the Initial sequence obtained using 
M13F and M13R oligonucleotide primers and used for further sequencing. The 
30 sequences of the oligonucleotide primers are shown in Table 2. 

Contigs were then assembled in Sequencher. The contigs Include the 
sequences of the SMART primers used to generate the initial cDNA library as well 
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as pGEM-T Easy vector sequence up to the EcoRI cut site both at the 5' and 3' 
end. 

Plasmid maps and the full cDNA sequences of TrCSa, TrCSb, TrCSd, 
TrMDH and TrPEPC polypeptides were obtained (Figures 1, 2, 5, 6, 9, 10, 13, 14, 
5 17,18.21, 22. 25. 26, 29 and 30). 



TABLE 3 

List of primers used for sequencing of the fuli-length cDNAs encoding CS, 

MDH and PEPC 



gene name 


clone ID 


sequencing primer 


primer sequence (5'>3*) 


TrCSa 


05wc1HsB08 


05wc1HsB08.f1 


TTGCCCGAGGCTATACTGTGGC 






05wc1HsB08.f2 


CAGCTCACCTAGTTGCTAG 






05wc1HsB08.f3 


CCATGGCCTAATGTTGATGC 






05wc1HsB08.r1 


TTGGCCTTTCAAGTGGCATTCC 






05wc1HsB08.r2 


CAGAATGGGAGGCACGACTTC 






05wc1HsB08.r3 


ATGTGAGCATAGTTTGCACC 


TrCSb 


05wc2HsD09 


05wc2HsD09.f1 


GACTGCCAGAAAACACTTCCAGG 






05wc2HsD09.f2 


ATGACTGCTTTAGTGTGG 






05wc2HsD09.r1 


CTCAAGTTTCTCCAGTGTGACAC • 






05wc2HsD09.r2 


TGACTTATGTATCCCACC 






05wc2HsD09.r3 


GCTCTGAATGGTTTAGCTGG 


TrCSd 


10wc1BsF10 


10wc1BsF10.f1 


GCACTGCCTGTTTCTGCTCT^TCC 






10wc1BsF10.f2 


AGCCAACTTATGAGGATAGC 






10wc1BsF10.r1 


CTCCAATACTCCTCGCGACGCC 






10wc1BsF10.r2 


AGGCACAACCTGGCCACTG 






10wGlBsF10.r3 


ACGTTGCCACCTTCATGATC 


TrMDH 


13wc1NsD01 


13wc1NsD01.f1 


GTTGTTATACCTGCTGGTGTT 






13wc1NsD01.r1 


CTCACTCAACCCTTGGAGAT 


TrPEPC 


15wc1DsH12 


15wc1DsH12.f1 


TCCTAAGAAACTTGAAGA6CTCGG 






15wc1DsH12.f2 


AGATGTTTGCTTACTAGC 






15wc1DsH12,r1 


GCCAGCAGCAATACCCTTCATGG 






15wGlDsH12.r2 


TTGCTTCTCAACTGTTCC 
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EXAMPLE 4 

Development of binary transformation vectors containing chimeric genes 
with cDNA sequences encoding CS, iMDH and PEPC 

5 To alter the expression of the polypeptides involved in organic acid 

biosynthesis to improve phosphorus acquisition efficiency as well as aluminium 
and acid soil tolerance in forage plants, a set of sense binary transformation 
vectors was produced. 

The pPZP221 binary transfonmatlon vector (Hajdul^lewicz et ah, 1994) was 
1 0 modified to contain the 35S^ cassette from pKYLX71 :35S^ (SchardI ef a/., 1 987) as 
follows: pKYLX71 :35S^ was cut with Clal. The 5' overhang was filled in using 
Klenow and the blunt end was A-tailed with Taq polymerase. After cutting with 
EcoRI, the 2kb fragment with an EcoRI-compatible and a 3'-A tail was gel-purified. 
pPZP221 was cut with Hindlll and the resulting 5' overhang filled in and T-tailed 
15 with Taq polymerase. The remainder of the original pPZP221 multi-cloning site 
was removed by digestion with EcoRI, and the expression cassette cloned into the 
EcoRi site and the 3' T overhang restoring the Hindlll site. This binary vector 
contains between the left and right border the plant selectable marker gene aacCI 
under the control of the 35S promoter and 35S temninator and the pKYLX71:35S^- 
20 derived expression cassette with a CaMV 35S promoter with a duplicated 
enhancer region and an rbcS terminator. 

A GATEWAY® cloning cassette (Invitrogen) was introduced into the 
multicloning site of the pPZP221:35S^ vector obtained as described following the 
manufacturer's protocol. 

25 cDNA fragments were generated by high fidelity PCR with a proofreading 

DNA polymerase using the original pGEM-T Easy plasmid cDNA as a template. 
The primers used (Table 3) contained af/B sequences for use with recombinases 
utilising the GATEWAY® system (Invitrogen). The resulting PCR fragments were 
used in a recombination reaction with pDONR® vector (Invitrogen) to generate 

30 entry vectors. In a further recombination reaction, the cDNAs encoding the open 
reading frame sequences were transferred from the entry vector to the 
GATEWAY®-enabled pPZP221 :35S^ vector. 
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The orientation of the constaicts (sense or antisense) was checked by 
restriction enzyme digest and sequencing which also confirmed the correctness of 
the sequence. Transformation vectore containing chimeric genes using full-length 
open reading frame cDNAs encoding white clover TrCSa, TrCSb, TrCSd, TrMDH 
5 and TrPEPC proteins In sense orientation under the control of the CaMV 35S^ 
promoter were generated (Figures 4, 8, 12, 16. 20. 24, 28 and 32). 
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TABLE 4 



List of primers used to PCR-ampiify tiie open reading frames of cDNAs 

encoding CS, MDH and PEPC 



gene name 


clone ID 


primer 


primer sequence (5'>3') 


1 rL/o3 


UOWG 1 nSDUO 


uowc 1 no DUO 1 


AGGCTTGATCTTAATGGCGTTCTT 
TCG 






nR\htf*^ MeRflAr 
v/owc 1 noDuor 


nnr^rtAPP AnTTrnT AP A AriA A Anr* 
TGGGTTTTCAATTTTAGGACGATG 
CG 


1 rooD 






nfMrJACAAnTTTftTAPAAAAAAnr* 

AGGCTTTGTTGATTGATCTTAATG 
GC 






uowc^ no u«7r 


ftiry^riA PP APTTTHT A P A An A A AHP 

TGGGTTAGTAATCCACAGATAACC 
G 


1 lOoQ 


i n^Afr^l Re PI H 
1 WWC 1 Doi 1 U 


1 Hwr^i RqPI Of 

1 UWw i Doi 1 KJt 


rmnnAP A AttTTTOT AP A A A A A AfiP 
AGGCTCTAGATTGTTGATTGATCT 
AAATGGC 






1 uwo 1 Dor 1 ur 


f^r^rU^ A PP A PTTTTiT A P A Art A A AnP 

TGGGTCTAGATTCAATTTTAGGAT 
GATGCACC 




13wc1 NsD01 


13wc1 NsDOIf 


GGGGACAAGTTTGTACAAAAAAGC 
AGGCTCTAGAAATTCCCATTACCA 
TTCATTCC 






13wc1NsD01r 


GGGGACCACTTTGTACAAGAAAGC 
TGGGTCTAGATTGACATTCTCTCG 
CATGGACGC 


TrPEPC 


15wc1DsH12 


15wc1DsH12f 


GGGGACAAGTTTGTACAAAAAAGC 
AGGCTTGAGAAGGAGTGAATTGCT 
CC 






15wc1DsH12r 


GGGGACCACTTTGTACAAGAAAGC 
TGGGTATGATATCTTAGCACACAC 
TTAAC 



5 EXAIMPLE 5 

Development of binary transformation vectors containing cliimeric genes 
with a combination of 2 or more cDNA sequences encoding CS, MDH and 

PEPC 

To alter tlie expression of the polypeptides Involved in organic acid 
10 biosynthesis to Improve phosphorus acquisition efficiency as well as aluminium 
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and acid soil tolerance In forage plants, a modular binary transformation vector 
system was used (Figure 125). The modular binary vector system enables 
simultaneous integration of up to seven transgenes the expression of which Is 
controlled by individual promoter and terminator sequences into the plant genome 
5 (Goderis et al., 2002). 

The modular binary vector system consists of a pPZP200-derived vector 
(Hajdukiewicz et al., 1994) backbone containing within the T-DNA a number of 
unique restriction sites recognised by homing endonucleases. The same 
restriction sites are present in pUC18-based auxiliary vectors flanking standard 

10 multicloning sites. Expression cassettes comprising a selectable marker gene 
sequence or a cDNA sequence to be introduced Into the plant under the control of 
regulatory sequences like promoter and terminator can be constructed in the 
auxiliary vectors and then transfenned to the binary vector backbone utilising the 
homing ehdonuclease restriction sites. Up to seven expression cassettes can thus 

15 be integrated into a single binary transfomriatlon vector. The system is highly 
flexible and allows for any combination of cDNA sequence to be introduced into 
the plant with any regulatory sequence. 

For example, a selectable marker cassette comprising the nos promoter 
and nos tenmlnator regulatory sequences controlling the expression of the nptll 
20 gene was PCR-amplified using a proofreading DNA polymerase from the binary 
vector pKYLX71:35S^ and directfonally cloned Into the Agel and NotI sites of the 
auxiliary vector pAUX3166. Equally, other selectable marker cassettes can be 
introduced into any of the auxiliary vectors. 

In another example, the expression cassette from the binary vector pWIVIS 
25 consisting of the ASSU promoter and the tob terminator was PCR-amplified with a 
proofreading DNA polymerase and directionally cloned into the Agel and NotI sites 
of the auxiliary vector pAUX3169. Equally, other expression cassettes can be 
introduced Into any of the auxiliary vectors. 

In yet another example, the expression cassette from the direct gene 
30 transfer vector pDH51 was cut using EcoRI and cloned directly into the EcoRI site 
of the auxiliary vector pAUX31 32. 
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TABLE 5 



List of primers used to PCR-amplHy plant selectable marker cassettes and 
the regulatory elements used to control the expression of CS, MDH and 

PEPC genes 



exDression 
cassette 


primer 


Drimer seauence (5'>3') 


nos::nptll-nos 


forward 


ATAATAACCGGTTGATCATGAGCGGAGAATTAAG 
GO 




reverse 


ATAATAGCGGCCGCTAGTAACATAGATGACACCG 
CG 


35S::aacC1-35S 


forward 


AATAGCGGCCGCGATTTAGTACTGGATTTTGG 




reverse 


AATAACCGGTACCCACGAAGGAGCATCGTGG 


35S^::rbcS 


forward 


ATAATAACCGGTGCCCGGGGATCTCCTTTGCC 




reverse 


ATAATAGCGGCCGCATGCATGTTGTCAATCAATT 
GG 


assu::tob 


forward 


TAATACCGGTAAATTTATTATGRGTTTTTTTCCG 




reverse 


TAATGCGGCCGCTAAGGGCAGCCCATACAAATGA 
AGC 



5 

The expression cassettes were further modified by introducing a 
GATEWAY® cloning cassette (Invitrogen) into the multicloning site of the 
respective pAUX vector following the manufacturer's protocol. In a recombination 
reaction, the cDNAs encoding the open reading frame sequences were transferred 
10 from the entry vector obtained as described in Example 4 to the GATEWAY®- 
enabled pAUX vector. Any combination of the regulatory elements with cDNA 
sequences of TrCSa, TrCSb, TrCSd, TrMDH and TrPEPC can be obtained. One 
typical example is given in Figure 126 with expression cassettes comprising the 
nptll plant selectable marker, TrPEPC, TrCSa and TrIVIDH. 

15 Complete expression cassettes comprising any combination of regulatory 

elements and cDNA sequences to be introduced into the plant were then cut from 
the auxiliary vectors using the respective homing endonuclease and cloned into 
the respective restriction site on the binary vector backbone. After verification of 
the construct by nucleotide sequencing, the binary transformation vector 
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comprising a number of expression cassettes was used to generate transgenic 
wliite clover plants. 

EXAMPLE 6 

Isolation of regulatory elements to direct expression of chimeric genes 
5 encoding CS, MDH and PEPC involved in organic acid biosynthesis 

To direct the expression of chimeric white clover genes TrCSa, TrCSb, 
TrCSd, TrMDH and TrPEPC involved in organic acid biosynthesis to specific 
tissues, regulatory elements showing specificity for expression in root or root tip 
tissue were Identified and isolated. 

10 Using the BLASTn algorithm, white clover EST sequence collections 

prepared as detailed in Examples 1 and 2 were searched with nucleotide 
sequences representing genes with known root-specific expression identified in 
GenBank as queries. Suitable candidate ESTs were identified and oligonucleotide 
primers for reverse transcription-PCR (RT-PCR) were designed (see Table 4). 

15 TABLES 



Oligonucleotide primers used in reverse transcription-PCR to confirm 
tissue specificity of candidate white clover ESTs 



gene 


forward primer (5'->3') 


reverse primer (5'->3') 


histone (intemal 
control) 


CCGATTCCGTTTCAATGGCTCGTA 


GCCATCCTTAACCCTAAGCACGT 


white clover phosphate 
transporter homolog 


TTGCATTTGCTTGGAACAACTAG 


GCAAGAGCAAACATGAAACCA 


white clover root iron 
transporter homolog 


ATGGGTCTTGGTGGTTGCA 


GCAGCAAGAAGATCAACCAAAGCCA 



Total RNA for RT-PCR experiments was isolated from a leaf, stolon, stolon 
20 tip, root and root tip of white clover plants grown in the glasshouse using the 
TRIZOL method. Reverse transcription was performed using SuperScriptll 
(Invitrogen) following the supplier's instructions. Preliminary PCR reactions using 
Dynazyme as the DNA polymerase were set up to detemnine the correct amount of 
template using the PCR primers for the internal control (histone). The results of 
25 this preliminary PCR were used to set up another round of PCR to determine the 
optimum number of cycles for linear amplification. The final PCR amplifications 
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were performed using the following cycling conditions: 94 ^C, 4 min., 1 time; 94 ^C, 
15 sec, 60 ^C, 30 sec, 72 ^C, 2 min., x times; 72 °C, 10 min., 1 time. The number 
of cycles in the amplification (x) was found to be dependent on the relative 
abundance of transcript and had to be optimised for each template. 

5 RT-PCR results using a white clover histone gene as an internal 

constitutively expressed control confirmed the tissue-specificity of two candidate 
ESTs to be root-prevalent (Figure 127 A and B). These were a phosphate 
transporter homolog (clone name 02wc1DsG07) and a root iron transporter 
homolog (clone name 05wc1 IsB1 1 ). 

10 A spotted white clover SAC library consisting of 50,304 clones with an 

estimated 99% genome coverage (6.3 genome equivalents) was screened using 
the phosphate transporter homolog EST nucleotide sequence as a probe. A 
number of positive BAG clones could be identified (Figure 128 A). After Southern 
hybridisation blotting (Figure 128 B) a 7.5 kb EcoRV genomic DNA fragment was 

15 selected and fully sequenced. The fragment contained the phosphate transporter 
homolog open reading frame and 4 kb of upstream sequence including the 
promoter region. A physical map of the genomic DNA fragment including the 
promoter region is shown in Figure 128 C. 

EXAMPLE 7 

20 Production by Agrobacter/um-medlated transformation and analysis of 
transgenic white clover plants carrying chimeric genes encoding CS, MDH 
and PEPC involved in organic acid biosynthesis 

A set of binary transformation vectors carrying chimeric white clover genes 
to alter the expression of the polypeptides Involved In organic acid biosynthesis to 
25 improve phosphorus acquisition efficiency as well as aluminium and acid soil 
tolerance in forage plants were produced as detailed in Examples 4 and 5. 

Agrobacterium-medlaied gene transfer experiments were performed using 
these transformation vectors. 

The production of transgenic white clover plants carrying the white clover 
30 TrCSa, TrCSb, TrCSd, TrMDH and TrPEPC cDNAs, either singly or in 
combination, is described here in detail (Table 7). 
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Preparation of Agrobacterium 

Agrobacterium tumefaciens strain AGL-1 transformed with one of tlie binary 
vector constructs detailed in Example 6 were streaked on LB medium containing 
50 Mg/ml rifampicin and 50 yg/ml kanamycin and grown at 27 °C for 48 hours. A 
single colony was used to inoculate 5 ml of LB medium containing 50 \ig/m\ 
rifampicin and 50 \ig/m\ kanamycin and grown over night at 27 and 250 rpm on 
an orbital shaker. The overnight culture was used as an inoculum for 500 ml of LB 
medium containing 50 pg/ml kanamycin only, incubation was over night at 27 **C 
and 250 rpm on an orbital shaker In a 2 I Erienmeyer flask. 

Preparation of white clover seeds 

1 spoon of seeds (ca. 500) was placed into a 280 \im mesh size sieve and 
washed for 5 min under running tap water, taking care not to wash seeds out of 
sieve. In a laminar flow hood, seeds were transferred with the spoon Into an 
autoclaved 100 ml plastic culture vessel. A magnetic stirrer (wiped with 70% 
15 EtOH) and ca. 30 ml 70% EtOH were added, and the seeds were stirred for 5 min. 
Th^e EtOH was discarded and replaced by 50 ml 1 .5% sodium hypochlorite. The 
seeds were stirred for an additional 45 - 60 min on a magnetic stirrer. The sodium 
hypo\^hlorite was then discarded and the seeds rinsed 3 to 4 times with autoclaved 
ddHaO^. Finally 30 ml of ddHaO were added, and seeds incubated over night at 10 - 
20 15''C in an Incubator. 

>^grobacfer/i//n-mediated transformation of white clover 

The seed coat and endosperm layer of the white clover seeds prepared as 
above were removed with a pair of 18 G or 21 G needles. The cotyledons were cut 
from the hypocotyl leaving a ca, 1.5 mm piece of the cotyledon stalk. The 
25 cotyledons were transferred to a petridish containing ddHaO. After finishing the 
Isolation of clover cotyledons, ddHaO in the petridish was replaced with 
Agrobacterium\^uspens\on (diluted to an ODeoo = 0.2 - 0.4). The petridish was 
sealed with Its lid and Incubated for 40 min at room temperature. 

After the irl^ubatlon period, each cotyledon was transfenred to paper towel 
30 using the small dissection needle, dried and plated onto regeneration medium 
RM73, The plates wWe incubated at 25°C with a 16h light/8h dark photoperiod. On 
day 4, the explants were transferred to fresh regeneration medium. Cotyledons 
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transformed with Agrobacterium were transferred to RM73 containing cefotaxime 
(antibacterial agent) and gentamycin. Tlie dislies were sealed with Parafilm and 
incubated at under a 16/8 h photoperiod. Explants were subcultured every 
three weeks for a total of nine weeks onto fresh RM 73 containing cefotaxime and 
5 gentamycin. Shoots with a green base were then transferred to root-inducing 
medium RIM. Roots developed after 1-3 weeks, and plantlets were transferred to 
soil when the roots were well established. 

Preparation of genomic DNA for reai-time PCR and anaiysis for the presence 
of transgenes 

10 3-4 leaves of white clover plants regenerated on selective medium were 

harvested and freeze-dried. The tissue was homogenised on a Retsch MM300 
mixer mill, then centrifuged for 10 min at 1700xg to collect cell debris. Genomic 
DNA was isolated from the supernatant using Wizard Magnetic 96 DNA Plant 
System kits (Promega) on a Biomek FX (Beckman Coulter). 5 pi of the sample (50 

15 pi) were then analysed on an agarose gel to check the yield and the quality of the 
genomic DNA. 

Genomic DNA was analysed for the presence of the transgene by real-time 
PCR using SYBR Green chemistry. PCR primer pairs were designed using 
MacVector (Accelrys) or PrimerExpress (ABI). The forward primer was located 
20 within the 35S^ promoter region and the reverse primer within the transgene to 
amplify products of approximately 150 - 250 bp as recommended. The positioning 
of the forward primer within the 35S^ promoter region guaranteed that endogenous 
genes in white clover were not detected. 

5 pi of each genomic DNA sample was run in a 50 pi PCR reaction 
25 including SYBR Green on an ABI 7700 (Applied Biosystems) together with 
samples containing DNA isolated from wild type white clover plants (negative 
control), samples containing buffer Instead of DNA (buffer control) and samples 
containing the plasmid used for transformation (positive plasmid control). Cycling 
conditions used were 2 mIn. at 50 °C, 10 min. at 95 ®C and then 40 cycles of 15 
30 sec. at 95 ^'C, 1 min. at 60 ^C. 
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Preparation of genomic DNA and analysis of DNA for presence and copy 
number of transgene by Southern hybridisation blotting 

Genomic DNA for Southern hybridisation blotting was obtained from leaf 
material of white clover plants following the CTAB method. Southern hybridisation 
5 blotting experiments were performed following standard protocols as described in 
Sambrook et al. (1989). In brief, genomic DNA samples were digested with 
appropriate restriction enzymes and the resulting fragments separated on an 
agarose gel. After transfer to a membrane, a cDNA fragment representing a 
transgene or selectable marker gene was used to probe the size-fractionated DNA 
10 fragments. Hybridisation was peri'onmed with either radioactively labelled probes or 
using the non-radioactive DIG labelling and hybridisation protocol (Boehringer) 
following the manufacturer's instructions. 

Plants were obtained after transformation with all chimeric constructs and 
selection on medium containing gentamycin. Details of plant analysis are given in 
1 5 Table 5 and Figures 1 30, 1 31 and 1 32. 
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